Sketch-to-Text Generation: Toward Contextual, Creative, and Coherent Composition
نویسنده
چکیده
The need for natural language generation (NLG) arises in diverse, multimodal contexts: ranging from describing stories captured in a photograph, to instructing how to prepare a dish using a given set of ingredients, and to composing a sonnet for a given topic phrase. One common challenge among these types of NLG tasks is that the generation model often needs to work with relatively loose semantic correspondence between the input prompt and the desired output text. For example, an image caption that appeals to readers may require pragmatic interpretation of the scene beyond the literal content of the image. Similarly, composing a new recipe requires working out detailed how-to instructions that are not directly specified by the given set of ingredient names. In this talk, I will discuss our recent approaches to generating contextual, creative, and coherent text given a relatively lean and noisy input prompt with respect to three NLG tasks: (1) creative image captioning, (2) recipe composition, and (3) sonnet composition. A recurring theme is that our models learn most of the end-to-end mappings between the input and the output directly from data without requiring manual annotations for intermediate meaning representations. I will conclude the talk by discussing the strengths and the limitations of these types of data-driven approaches and point to avenues for future research.
منابع مشابه
The Impact of Contextual Clue Selection on Inference
Linguistic information can be conveyed in the form of speech and written text, but it is the content of the message that is ultimately essential for higher-level processes in language comprehension, such as making inferences and associations between text information and knowledge about the world. Linguistically, inference is the shovel that allows receivers to dig meaning out from the text with...
متن کاملAutomated Generation of Graphic Sketches by Example
Hand-crafting effective visual presentations is time-consuming and requires design skills. Here we present a case-based graphic sketch generation algorithm, which uses a database of existing graphic examples (cases) to automatically create a sketch of a presentation for a new user request. As the first case-based learning approach to graphics generation, our work offers three unique contributio...
متن کاملROBODANZA: Live Performances of a Creative Dancing Humanoid
The paper describes the artistic performances obtained with a creative system based on a cognitive architecture. The performances are executed by a humanoid robot whose creative behaviour is strongly influenced both by the interaction with human dancers and by internal and external evaluation mechanisms. The complexity of such a task requires the development of robust and fast algorithms in ord...
متن کاملThe role of creative economics in entrepreneurship and revenue generation of public libraries: A systematic review
Purpose: The present study was conducted to identify the status of research on entrepreneurship and income generation in public libraries with a focus on creative economics. Method: The present study is a systematic study. The statistical population of this study was all the researches done in the field of creative economy in public libraries that have been published in connection with entrepr...
متن کاملSketch-to-Image Generation Using Deep Contextual Completion
When the input to pix2pix translation [9] is a badly drawn sketch, the output follows the input edges due to the strict alignment imposed by the translation process. In this paper we propose sketch-to-image generation, where the output edges do not necessarily follow the input edges. We address the image generation problem using a novel joint image completion approach, where the sketch provides...
متن کامل